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Feedback C ntrol of a Chemical Mechuiical Polishing Device 
Providing Manipulation of Removal Rate Profiles 

Related Applications 

This application claims priority from £q)plication serial number 60/298.878. filed Jime 19, 2001 
5 and entided "Advanced Process Control for Semiconductor Manufacturing Process," and from 
application serial number 09/943,955, filed August 31, 2001 and entided "Fbedback Control of a 
Chemical Mechanical Polishing Device Providing Manipulation of Removal Rate Proffles." both of 
which are incorporated herein by refMence. 

Field of die Invention 

10 The preseant invention genorally relates to the planarizadon of substrates, in particular, 

semiconductor wafers, and more particularly to a method and apparatus for providing feedback control 
of tlie planarization process. 

Background of the hivention 

Chemical-mechanical polishing (CMP) is used in semiconductor fabrication processes for 
15 obtaining ftdl planarization of a semiconductor wafer. The method involves removing material, e.g., a 
sacrificial layer of snrikce material, from the wafer (lypicaUy, silicon dioxide (SiOj)) usmg mechanical 
contact and chemical erosion. Polishmg flattens out height diflferences. since areas of high topography 
(hills) are removed faster than areas of low topogr^hy (valleys). 

CMP typically utilizes an abrasive shirry dispersed m an alkaline or acidic soluticm to planarize 
20 tiie surface of the wafer tinough a combination of mechanical and chemical action. Generally, a CMP 
tool inchides a polishing device (having an attached wafer to be polished) positioned above a rotatable 
dicular platen on which apolishmg pad is mounted. In use. the platen may be rotated and an abrasive 
slurry is introduced onto flie poUshing pad. Once tiie shirry has been appUed to the poUshing pad, a 
downward force may be appUed to a rotating head to press the attached wafer against the pad. As tfie 
25 wafer is pressed against the polishing pad. the wafer is mechanically and chemicaUy polished. 

The effectiveness of a CMP process may be measured by its polishing rate, and by tiie resulting 
finish (absence of small-scale roughness) and flatiiess (absence of large-scale topography) of tiie 
substrate surface. The poUshing rate, finish and flatness are determined by a variety of factors, 
mcluding the pad and slurry combination, tiie relative speed between ttie substrate and pad and tiie force 
30 pressing the substrate against the polishing pad. 

As semiconductor processes are scaled down, tiie importance of CMP to tiie ftbrication process 
increases, hi particular, it is increasingly important to control and minimize witiiin wafer (WIW) 
tiiickness non-uniformity. A variety of factors may conttibute to producing variation across tiie surfece 
of a wafer during poUshing. For example, variations m tiie surface topography may be attributed to 
35 drift of tiie processing conditions in tiie CMP device. TypicaUy, flie CMP device is optimized for a 
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particular process, but because of chemical and mechanical changes to the process. e.g.. changes m the 
polishing pad during polishing, degradation of process consumables, and other factors, the CMP 
process may drift from its optimized state, hi addition to processing drift, the wafer surface coming 
into the CMP process may be non-unifonn, which exacerbates die pnscess-induccd variations across the 
post-polished surface. 

Recent attempts to correct processing drift include feedback control, in which information 
generated during cuirent processing is used to adjust future processing runs. One control variable used 
in such feedback control of the polishmg step includes Ae arm oscillation length of the poUshmg tool. 
Feedback loops have also been developed for optimization of poUshing pad conditioning. However, 
these schemes are still not adequate in today's manufacturing environment to satisfactorily compensate 
for the aforementioned effects. 

The present invention is directed to overcoming, or at least reducing (he effects of. one or more 

of the problems set forth above. 

Sununary of the Invention 

Hie present invention relates to a method, apparatus and medium for planarizing a surface of a 
substrate, for example, a semicondnctor wafisr. in order to improve run-to-run control over the wafer 
thickness proffle. The present invention uses a model (which can be implemented as a single model or 
multiple models) of the planarization process to predict material removal across the wafer surface and 
to improve within wafer thickness uniformity. Deviations from the predicted outcome are used to set 
new poUshing parameters, which feedback into the process to enhance process results. 

hi one aspect of the present invention, a mefliod of producing a uniform wafer thickness profile 
in a poUshing operation includes (a) providing a model for a wafer poUshing that defines a pluraUty of 
regions on a wafer and identifies a wafer material removal rate in a poUshing step for each of the 
regions, and (b) poUshing a wafer using a poUshing redpe that generates a target thickness profile for 
each re^on. 

In another aspect of the present invention, a mediod of controUing surface non-unifoimity of a 
wafer in a poUshing operation mcludes (a) providuig a model for a wafer poUshing that defines a 
ptarafity of regions on a wafer and identifies a wafer material removal rale in a poUshing step of a 
poUshing process for each of the regions, wherein the poUshmg process inctades a pluraUty of poUshing 
steps, (b) poUshing a wafer using a first poUshing recipe based upon an mcoming wafer thickness 
profile, (c) determining a wafer thickness profile for the post-poUshed wafer of step (b), and (d) 
calculating an updated poUshing recipe based upon die wafer thickness profile of step (c) and the model 
of step (a) to maintain a target wafer thickness profile. 

In at least some embodiments of the present invention, the first poUshing recipe is based on the 
model of Step (a) to obtain the targpt wafer fliickness profile, or the first poUshing recipe is determined 
eoo^irically. 
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In at least some embodiments of the piesmt invention, the plurality of regions in the model of 
step (a) includes re^ons extending radially outward from a center point on the wafer. The model may 
include four or more regions. 

In at least some embodiments of the present invention, the polishing of step (b) includes 
5 polishing the wafer at a plurality of polishing stations. The polishing step may be carried out at three 
polishing stations. 

In at least some embodiments of the present invention, the polishing recipe is the same at at 
least two polishing stations. 

In at least some embodiments of the present invention, the polishing recipe is different at at 

10 least two polishing stations. 

In at least some embodiments of the present invention, calculating the updated polishing recipe 
of step (c) includes calculating updated polishing recipes for each of the plurality of polishing stations. 

In at least some eihbodiments of the present invention, the updated polishing redpes foe each of 
the plurality of polishing staticMos accounts for the tool state of the mdividual polishing stations. The 
15 wafer thickness profile for each of the subsequent polishing stations may be provided by the prediction 
from previous stations. 

hx at least some embodiments of die present invention, the step of providing a model includes 
(e) raeasunng pre-polished wafer thickness in each of a plurality of regions defined on one or more 
wafers, (£) polishing the one or more wafers, wherein polishing includes polishing the one or more 

20 wafers in a plurality of polishmg steps, (g) measuring the wafer material removal rate for the one or 

more wafers at each of tiie plurality of regions after each of the polishing steps of step (g), (h) providing 
a model defining tiie effect of tool state on polishing effectiveness, and (i) recording the pre-polished 
and post-polished wafer ttiicknesses for each or flie regions on a recordable medium. The model may 
further include fitting the data to a linear or non-linear curve that establishes a relationship between the 

25 material removal rate of a region of the wafer and a polishing parameter of interest. 

In at least some embodiments of the present invention, polishing parameter includes polishing 
time. The polishing parameters may further include a parameter selected from the group consisting of 
polishing time, polishing pad down forces and velocity, slurry flow and composition, conditioning 
time, conditioning disk down forces and velocity, oscillating speeds of both tiie conditioning disk and 

30 the wafer carrier. 

In at least some embodiments of die present invention, wafer removal for a region j (AR)) in 
the model of step (a) is determined according to the equation: 

35 where xj, X2, X3, and jcj are tfie additional parameter values for polishing steps 1, 2, 3, 4, and 

5, respectively; ^t, ts, and ts are the polishing times for polishing steps 1, 2, 3, 4, and 5. 
respectively, and Cajj provides tiie contribution to wafer removal of the variable x in polishing step a in 
region i; and Ca2j provides ttie contribution to wafa: removal of polishing time in polishmg step a. Hie 
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wafer material removal rate proffle may account for tool state by scaling the proffle using tbe scalmg 
factor: 



where the terms tp and tj refer to pad and disk life, respectively, with units of hour; and the 
terms Arf and V a« empirically detemrined coefficients relating pad and disk life to removal rate. 

In at least some embodiments of the present invention, an updated polishing recipe is attained 
by solving the equation: 

min/{y'.g(*)) 

X 

where x is a vector of times and other processing parameters corresponding to the polishing recipe; g(x) 
is &e model for the poUshing process. /''isavector of the desired average region wafer thickn^^^ 

and J( ) is a penalty function to penalize the deviation between the model predictions g(x) and 

the desired thicknesses y^. 

In another aspect of the present invention, a mediod of determining a model for wafer thickness 
profileinclude8(a)measuringpr^polishedwafertMcknessineachofaphiraUtyofregionsdefi^ 
one or more wafers, (b) polishing the one or more wafers, wherein polishing inctades polishing the one 
or more wafers in a phiraUty of polishing steps, (c) measuring the wafer material removal rate for the 
one or more wafers at each of the phirdily of regions after each of the polishmg steps of step (b) . (d) 
providing a model defining the efect of tool state on poUshing effectiveness, and (e) recording the pre- 
polishedand post-polished waferlhicknessesforeachorthe regions on a recordable medium. The 
model may mclude fitting the data to a linear or non-linear curve lhat establishes a relationship between 
the material removal rate of a region of the wafer and a polishing parameter of interest 

In at least some embodiments of the present invention, the poUshing parameter includes 
polishing time. The polishmg parameters may include a parameter selected from the group consisting 
of polishmg time. poUshing pad down forces and velocity, slurry flow and composition, conditioning 
time, conditioning disk down forces and velocity, oscillating speeds of bofli the conditioning disk and 
the wafer carrier, 

hi at least some embodiments of the present invention, the wafer material removal for a region j 
(AR 'j) in the model of step (a) is detamuied accwding to flie equation: 

wherex,.:ciA;AX« and *5 are Ae additional parameter values for poUshing steps 1.2. 3, 4. and 
5. respectively; h. r» tj. f« and are the poMshing times for poUshing steps 1, 2. 3. 4, and 5. 
respectively, and provides the contribution to wafer removal of the variable x in poUshing step a in 
region j; and provides the contribution to wafer removal of poUshing time m polishing step a. The 
wafer material removal rate profile may account for tool state by scaling the profile using the scaUng 
factor: 
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where the terms tp and ta refer to pad and disk life, respectively, with units of hour; and the 
terms and jk^j are empiricaUy determined coefficients relating pad and disk life to removal rate. 

In at least some embodiments of the present invention, a the model is determined using less 
than 10 wafers. 

In one aspect of the present invention, an apparatus for conditioning polidung pads used to 
planarize substrates is provided having a carrier assembly having a pluraUty of arms for holdmg a wafer 
positionable over a plurality of planarizing surfaces of a ptarality of polishing pads, controlling means 
capable of controlling an operating parameter of the polishing process, and acontroller operatively 
coupled to the controlling means, the controller operating the controlling means to adjust the operating 
parameter of the polishing process as a function of a model for a wafer fliickness profile, the model 
includmg defining a poUshing model that defines a plurality of regions on a wafer and identifies a wafer 
material removal rate in a polishing step of a poUshing process for each of tiie regions, wherein tiie 
polishing process includes a plurality of polishing steps. 

In at least some embodiments of the present invention, tiie model define wafer removal for a 
regiony (AR'j) in tiife wafer material removal rate model according to die equaticm: 

AR'j = (C,U = X, + Cai>Stt^(C2»SX2 + Cvi)=h-^ (C3JJ=Xs + CsnJSts + (C4JJSXJ+ C47j)=U* (c„jsxs + cs^sts. 

where x,. *a Xi, x* and xj are the additional parameter vataes for polishing steps 1, 2. 3, 4. and 
5, respectively; tj, tj, and f5 are tiie polishmg times for polishing steps 1. 2. 3, 4, and 5, 
respectively, and c„y provides tiie contribution to wafer removal of tiie variable x in polishing step a in 
region/; and provides tiie contribution to wafer removal of polidmig time in polishing step a. 

In anoflier aspect of tiie present invention, a computer readable medium includes insteuctions 
being executed by a computer, flie instractions mctading a computer-implemented software application 
for a chemical mechanical polishing process, and tiie instructions for implementing tiie process mclude 
(a) receiving data from a chemical mechanical polishing tool relating to tiie wafer removal rate of at 
least one wafer processed in tiie chemical mechanical poUshing process, and (b) calculating, from tiie 
data of step (a), updated poUshing recipe, wherem flie updated polishing recipe is calculated by 
determining ttie difference between an output of a wafer material removal rate model and tiie data of 
step (a). 

hi at least some embodiments of tiie present invention, the model for a wafer material removal 
rate defines a plurality of regions on a wafer and identifies a wafer material removal rate in a poUshing 
step of a poUshing process for each of the regions, wherem tiie poUshing process mcludes apluiality of 
polishing steps. 

hi at least some embodiments of tiie i«eseait invaition, tiie wafer removal for a regjon; (AR V) 
in the wafer material removal rate model is detomined according to the equation: 
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wbeiKXi, X2. xj, X* and xj are the additional parameter values for polishing steps 1, 2, 3, 4, and 
5. respectively; h, ^* and are the poUshing times for polishing steps 1, 2, 3, 4. and 5, 
respectively, and c,,j provides the contribution to wafer removal of the variables in polishing step a in 
region/, and Ca2j provides die contribution to wafer removal of poUsMng time in polishing step a. 

The term "target wafer tiiickness profile" represents the desired processing outcome of the 
CMP process. Some tolerance is built into die profile, so tiiat a feedback control system defines a target 
profile and acceptable standard deviations tiierefrom, wherein such deviations would not require 
updating of tiie polishing recipe. Use of the term target wafer thickness profile includes the target and 
the standard deviation therefrom. 

The term wafer is used in a general sense to include any substantially planar olgect tiiat is 
subject to poUshing. Wafers include, in additional to monoUfli stmctures, substrates having one or 
more layers or thin fihns deposited tiieicon. Throughout the specification, wafer and tfiin film may be 
used interchangeably, unless otherwise indicated. 

•Tool state" refers to the ccmdition of the consumable or variable con?)onents of die CMP 
apparatus. Most often tiiis term is used to refer to the state of flie conditioning disk and polishing pad, 
which change continually over the lifetime of the pads, and idle time. Typical conditioning disk life is 
about 60 hours and Qfpical poUsWng pad life is about 30 hours. 

BtirfDescription flie Drawings 

Various objects, features, and advantages of the present invention can be more fully appreciated 
wifli reference to tiie foUowing detailed description of the invention when considered in connection 
witii the foUowing drawing, in which like reference numerals identify Uke elements. The following 
drawings are for the purpose of illustration only and are not intended to be limiting of the invention, the 
scope of whidi is set fortii in the clainos that foUow. 

FIO. 1 is a perspective view of a chemical mechanical poUshmg apparatus. 

HG. 2 is a plot of oxide material removal (A) across tiie surfiice of a substrate for successive 
polishing stq>s in a polishing recipe. 

FIG. 3 is a flow diagram graierally illustrating model development; 

FIG. 4 is a schematic illustration of a wafer showing regions defined for tiiickness profile 

model. 

FIG. 5 is a flow diagram of tiie feedback loop used in a CMP poUshing operation, as 
conteaplated by at least some embodiments of tiie present invention. 

HG. 6 is a schematic illustration of model development for a CMP process using two platens 
with different polishing recipes, as contemplated by at least some embodiments of the present 
invention. 
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FIG. 7 is a block diagram of a computer system that includes tool representation and access 
control for use in at least sOToe embodiments of Ihe invention. 

FIG. 8 is an illustradon of a floppy disk that may store various portirais of the software 
accoding to at least some embodiments of the invention. 

Detafled Description of the bvention 

FIG. 1 shows a parspecdve view of a typical apparatus 100 for polishing one or more 
substrates 110. The CMP apparatus 100 inchides a series of polishing stations 101 and atransfer 
station 102 for loading and unloadmg substrates. Each polishing station includes a rotatable platen 103 
on which is placed a polishing pad 104. A source of polishing fluid 111 may be provided to supply 
polishmg fluid 112 to the polishing pad 104. Each polishing station may include an associated pad 
conditioning apparatus 105 to maintain flie abrasive condition of the polishing pad. A rotatable multi- 
head carousel 106 is supported by center post 107 about which the carousel rotates. The carousel lOd 
inchides multiple carrier heads 108 that are capable of independently rotating about its own axis. The 
carrier head 108 receives a substrate from and deUvers a substrate to the transfer station 102. The 
carrier head provides a controllable load, i.e., pressure on the substrate to push is against the polishing 
pad when the polishing station and the carrier head are engaged. Some carrier heads include a retaming 
ring 109 to hold the substrate and help to provide the poUshing load. To effectuate polishing, the platen 
103 may be rotated (typicaUy at a constant speed). Moreover, individually variable down forces may 
be applied by each of the carrier heads 108, for example by adjusting retaining ring pressures. The 
carrier heads 108 holding substrates 110 can rotate on axis 113 and oscillate back and forth m slot 114. 

One type of CMP process polishes the wafer m a series of polishing steps. By way of example, 
HG. 2 shows a CMP proffle for eight successive polishmg steps 201 through 208 for a single wafer 200 
mm in diamet«. Each polishing step removes a subset of the total material to be polished from the 
substrate surface. Moreover, the thickness profile generated by each polishing step may be different, as 
is seenby comparison of profiles 201 and 208. The final, thin film thickness profile is the sum of the 
individual poUshing step thickness profiles and desirably produces a uniform wafer fliickness across the 
surface. 

A CMP process may include Ae transport of a sample from polishing station (platen) to 
polishing station (platen). One type of CMP process distributes wafer removal among the various 
platens, and each platen will have a fuU set of polishing steps to achieve the desired material removal 
for that platen. Any combination of removal is possible. Thus, by way of example, where it is desked 
to remove 6000 A of material in total, 3000 A may removed from the polishing station at platen 1, 1000 
A may be removed at platen 2, and 2000 A may be removed at platen 3. The polishing recipe for each 
platen may be the same or different. 

Hie CMP processes described above may be modeled to provide a format for improvmg the 
planarization process. The model can be represented as raw data that reflects the system, or it can be 
represented by equations, for example multiple input-multiple output linear, quadratic and non-linear 
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eqaattons. which describe the leladonship among the variables of the system. By using a model, the 
withm wafer thickness unifomiity can be improved or maintained nm-to-run by adjusting the polishing 
parameteis during wafer polishing to correct for unmodeled effects or to correct for drift m the 
polishing process conditions. By way of example, polishing time, polishing pad down forces and 
velocity, slurry flow and composition, conditioning time, conditioning disk down forces and velocity, 
oscillating speeds of both the conditioning disk and the wafer carrier may be adjusted during the 
polishing operation in a feedback and feedforward loop that predicts and then optimizes the polishing 
redpe. 

According to at least some embodiments of the present invention, an initial model is developed 
based upon knowledge of the wafer polishing process, as is shown in a flow diagram (FIG. 3). An 
initial understanding of the system is acquired in step 300. which is used to design and run a design of 
experiments (DOE) of step 310. The DOE desirably is designed to establish the relationship between or 
among variables that have a strong and predictable impact on the processmg output one wishes to 
control, e.g., wafer thickness. The DOE provides data relatmg to process parameters and process 
outcome, which is then loaded to the advanced process control system in step 320. The advanced 
processing control system may be a controller or computer that uses the data to create and update the 
model. Processing teipiirements such as output targets and process specification are determined by flie 
userin step 325. wMcharecombmed with the DOE data to generate a woridng model in step 330. 

An illustrative example of model development is now described. According to at least some 
embodiments of the present invention, a model stracture is defined that models wafer material removal 
rate (polishing) profiles as independent steps in the CMP process. As described herem above (FIG. 2). 
d,eindividualstepsmaybecombmedtoproduceauniform.finalwaferthickness. The steps to be used 
in the model can also be defined as subsets of removal rate profiles; that is. a step may consist of a 
fannly of removal rate profiles that have similar characteristics. For each family of removal rate 
profiles. poUshing parameters are identified, which may be varied, and their effect on the outcome is 
determined. Exemplary polishmg variables, which may be included m this model include, but are not 
Ihnitedto. poBshing time, polishing pad down forces and velocity, slurry flow and composition, 
conditioning time, conditioning disk down forces and velocity, and sweep speeds of both the 
conditioning disk and the wafer carrier. 

hi at least some embodiments of the present invention, the model reUes on removal rate profiles 
based on regions of the wafer. As is shown in FIG. 4. a wafer may be divided into radial regions 401 
through 405 of varying width and area. The number of regions is not set for the model and may be 
selected based upon the poUshmg profile. Thus, for example, HG. 2 designates seven (7) regions 
across the wafer, while HG. 4 illustrates five (5). The size and location of the regions also may vary 
and may be selected based upon die effect of certain poUshing parameters on the wafer in that region. 

The number, size and location of regions may be selected based upon the complradty of the 
wafer material removal rate profile. In at least some embodiments, it is desirable that the profile m any 
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given legion be snbstantiaUy unif onn, particalarly in those cases wheie a number of wafer tbickness 
measitteinentwithinaregionaieaveiagpdtodefinethetegion-averaged^^ TTius.atthe 
edges where edge effects can be dramatic, narrow regions encompassing only the outer regions may be 
selected. Near flie center of the wafer where polishing effects may be more subtle, a larger region may 
5 be defined. The regions are defined such that all azinmthal variation is averaged out since the CMP tool 
can not correct for such variation. Film thickness measurements taken within a region of the wafer aie 
averaged to give the average thickness for that region. 

To obtain DOE data, a polishing step is run and. based upon inconaing measurements, e-g.. pre- 
polishing and post-polishing wafer thickness measurements, and processing parameter values, a 
10 removal rate profile or, equivalently. a wafer thickness profile, can be determined for each region. 

Conventionally, data may be acquired empirically, by carrying out a series of experiments ov^ a range 
of parameter values and over the lifetime of the polishing pad and conditioning disk. Such an approach 
makes no assumptions about the processing characteristics of the polishing operation, and the data is fit 
to the appropriate curve to define the modeL This approach requires a laigp number of wafers, at least 
15 30 for a 4 step process, and is time consuming (a typfcal disk life is about 60 hours). 

ii at least some embodiments of the present invention, a modified approach to obtaining DOE 
dataisused. The approach assumes that the data may be fit to a linear curve and that superposition is 
valid. Superpositionassumesthatthesameresultsare attamed by performing a first step for a set time, 
foUowed*y perfomring a second step for a set time. e.g.. separately, but sequentially, as are attained by 
20 nmnmgtiietwostepstogelher. ih addition, the approach uses an estabUshed model to relate pad and 
disk life to polishmg performance. These assumptions significanfly reduce flie amount of data (and 
hence number of samples) required to model the system appropriately. In at least some embodiments of 
the present invention, it is sufficient to run less tiian 10, and even 6-8 wafer for proper model 
development. By way of example only, tiie DOE may include 5-7 poUshing steps and the polishing 
25 recipe may be carried out on a few wafers, as few as one, or for example 5-8 wafers. More wafers are 

requited for polishing recipes with more poUshing steps. 

By way of example.aseries of experiments may be conductedforapolishing system of interest 

as described above for determining the relationship between wafer material removal rate profile and 
polishingtimeandotherparametersofinierest Standard polishing procedures may be used, with all 

30 polishing pad and wafer conditions held constant witii the exception of flie parametBc(s) under 

investigation. Exemplary polishing parameters tiiat may be held constant include polishing pad size. 
poUshing pad composition, wafer compoation. pad conditioning time, rotational velocity of the 
polishing pad. and rotational veloci^ of die wafer. In at least some embodiments of the present 
invention, flie parameters under mvestigation include at least tiie poUshing time for each of flie 

35 polishing steps in flie poUshing mcipe and flie poUshing down force (P). as defined by retaining ring 
pressure. As is shown in greater detail in tiie analysis ttiat follows, additional parameters may be 
incorporated into the model. 
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Once data from DOE runs are obtained, a model may be developed. A model having five- 
polishing steps may be defined as follows: 

AR 'j = Cjj ^tj + tZj + C^' Sts + C4i ^t4 + csj 0ts, (I) 

Where AR 7 is the amount of material removed for region j of the wafer; tu ti, ts, t4, and ts are 
5 Ae polishing times for polishing steps 1. 2. 3, 4. and 5, respectively; and cjj , c;^ C5> C4j: and csj are 
removal rates for region j in poUshing steps 1, 2, 3, 4, and 5, respectively. 

Additional parameters may be included in the model, and the model may be defined as foUows: 

AR) = (Cnj SXi + Cnj)Sti + (Czi^ SX2 + Ct^)Sh^ (Csa SX3 + C32j)^t3 + (C4^ C42j)^t4'^ (C5IJ ^Xs + Cs2])^t5, 

(2) 

10 where x/, xa, JCj, jc^ and X5 are the additional parameter values for polishing steps 1, 2, 3, 4, and 

5, respectively; t2, ts, t4, and ts are the polishing times for polishing steps 1, 2, 3, 4, and 5, 
respectively, and Caij provides the contribution to wafer removal rate of the variable x in polishing step 
a in region 7; and c^j provides the contribution to wafer removal rate of polishing time in polishing step 
a. Thus, the model permits inclusion of an unlimited number of processing parameters. 

15 at least some enix)diments of the present invention, the model may be further augmented to 

include the effect of the tool state. The tool state represents the effect of wear, use and agmg on the 
tool, and includes the condition of the conditioning disk and polishing pad, represented by disk life and 
pad life, and also includes idle time. This functionality may be expressed as a scaling factor. An 
exerr^jlary scaling factor that takes pad life and disk life into account is shown in eq. 3. 

20 ARj + H-fcp, 't^ ^t^)\AR'^\,^^,^^) (3) 

The terms tp and refer to pad and disk life, respectively, with units of hour; and the terms hp, 
kd and kpd are empirically determined coefficients relating pad and disk life to reiiMval rate, or 
equivalentiy, to the amount of material removed. AR'j at tp^O and tj=^0 is the amount of material 
removed under initial polishing conditions. 

25 Using a model such as the one just described, a prediction for re^on-aveiaged, final wafer 

thickness can be calculated given mcoming fliickness, the pad and disk life, the polislring step times and 
the value for any other sdected parameter for Aose steps which vary that parameter. 

Process model development and optimization are carried out with reference to a specific 
polishing system. That is, the conditions that effect within wafer uniformity are specific to the type of 

30 wafer being polished, the slurry used in polishing and die composition of flie polishing pad. Once a 

waf M/slurry/polishing pad system is identified, the system is characterized using the models develc^ 
accordmg to the invention. In at least some embodiments of the present invention, it is Mivisioned that 
a separate model (or at least a supplement to a composite model) is created for each slurry/polishing 
pad wafer combination (i.e., for each different type/brand of slurry and each typefljrand of pad tiiat may 

35 be used in production witii a ^ven type of wafer. 

Also, at least some embodiments of the present invention contemplate a wafer polishing model 
that can accommodate polishing at multiple platens, eitiier in parallel or seriaUy. Hie CMP process 
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often consists of multiple platens, which are opiated simultaneously. Typically, each platen removes a 
portion of the total amount of material to be removed. The wafers are advanced from platen to platen, 
and each platen has a separate recipe that determines the polishing step times and other processmg 
paramet^s, such as retaining ring pressures for each of the steps that are performed on that platen. 
5 A process model that accounts for the effects of multiple platens that perform similar or 

different poUshing steps on wafer thickness profile is illustrated in FIG. 6. Ih a first phase 600 of the 
model, the polishing recipe 610 (here, 6 steps) for platen 1 620 is determmed (die "first poUshing 
process")- Process input data 630, such as incoming wafer thickness for the defined regions of the pre- 
polished wafer, disk life and pad life, are input into the model. The wafer is polished and final wafer 

10 thicknesses 640 for each of the wafer regions is measured. Post-polished regions thicknesses 640 from 
the first poUshing process are used as input data in a second phase 645 of the model development. A 
second poUshing recipe 650 is carried out on platen 2 660, which can be the same as or different from 
that carried out on platen 1 620. Pad life and disk life factors 655 relating to the pad and conditioning 
disk used on platen 2 660 are also included in the model. Final thickness measurements 670 are taken 

15 and used in the nK3del development Thus, the method of the invention can accommodate a model that 
involves multiple polishing processes on multiple platens having different tool stales and is able to 
provide platen-specific feedback 680 and 690 to platens 1 and 2, respectively. The model is extremely 
versatile and able to accomnaodate hig^y conq)lex poUshmg scenarios. 

According to at least some embodiments of the present invention, an uutial nKxiel developed as 

20 described herein above is used in at least some embodiments of tibe presoat invention to control the run- 
to-run uniformity of the poUshing process and to provide a feedback loop for updating the poUshing 
ledpe (FIG. 5), Briefly, one or more wafers is processed according to a first polishing recipe. A 
ihidcness measurement is taken across the poUshed wafer to obtain a wafer thickness profile, which is 
coiiq>ared to the predicted wafer tiiickness profile calculated by the model. If die measured wafer 

25 thickness profile indicates deviation from tiie desired results, those deviations are used in an 

optimization process to update the poUshing recipe. The updated recipe is then used m a feedback loop 
to progressively optimize the polishing recipe so as to improve or maintam widiin wafer film thickness 
uniformity. 

According to the processing flow diagram in FIG. 5, irutial processmg conditions, e.g., tool 
30 state and wafer state are identified that wiU provide a desired wafer removal rate profile in step 500. 
Hie initial conditions may be determined empiricaUy or by using the processing model of at least one 
embodiment of the present invention. If a processing model is used, a controUer can use this model to 
calculate step times and processing parameters to polish an incoming profile to a target flat profile witti 
a desired tiiickness as shown ui step 510. Wafers are poUshed according to the initial poUshing recipe 
35 in the CMP tool at step 520. ThetMcknessof the polished wafer is measured and delation from the 
predicted thickness is determined in step 530. In step 540 it is ddiermined whether the deviation 
exceeds an established tolerance. If the deviation is within acceptable ranges, no changes are made to 
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the polishing lec^ and flie controUea: instructs the tool to reuse the existing recipe ip step 550. If the 
deviation is outside acceptable limits, new target parameters ate set in step 560 and are fedback in step 
570 into flie controUer wheae the polishing recipe is optimized according to an updated model that takes 
the deviation from the predicted value into consideration. Hie polishing st^ may be repeated and 
futdier updates of the polishing recipe ate possible. 

Process control of the CMP process according to at least one embodiment of the present 
invention permits optimization of the wafer removal rate for series of regions; across the wafer surface. 
By individually optimizing for the regions/ of the wafer, greater control over the total surface is 
attainable. Thus, greater within wafer uniformity is achieved. 

An exemplary optimization method, which can be used m determining an updated model for 
detonuning an updated polishing recipe, solves the equation: 

X 

where * is a vector of times and otiier processing parameters corresponding to Ac polishing recipe; g(x) 
is the model for the CMP process as described above in eqs. 1-3; is a vector of the desired average 
re^on wafer thicknesses; smdfly^. g(x)) is some function which is meant to penalize the deviation 
between the model predictions g(x) and the desired thicknesses y^. 

Thus, the optimization method suggests that the model need not correct for 100% of the 
deviation from predicted value. A penalty function may be used to reflect uncertainty in the measured 
or calculated parameters, or to "damp" the effect of changing parameters too quickly or to too great an 
extent It is possible, for example, for the model to ovetcompensate for the measured deviations 
thereby necessitating another adjustment to react to the overcompensation. Ihis leads to an 
optimization process that is jumpy and takes several iterations before Ihe optimized conditions ate 
realized. 

Based upon this optimization method, the post-poHsWng wafer tiiickness is measured and the 
difference between the precttcted tiiickness and the final tiiickness is determined. Ihe error in 
prediction, also known as a bias, is tfien linearly added into die model such tiiat tiie piedicted final 
thickness more closely matches flie actual final thickness. This bias is added to each region/, which is 
modeled as is shown in the following equation: 

FTj==ITj-ARj + bj (5) 
where FTj is tiie predicted final tiiickness of region;; FTj is die incoming tiiickness of region;; A/?, is flie 
predicted amount which is removed from region; given a set of recipe parametas; and bj is flie bias 
term which arises due to flie difference between ttie piedicted and actual amount removed from region;. 
The process of Imearly updating a model wifli bias toms based upon flie difference between a model 
prediction and an actoal measurement is part of at least some feedback controls in at least some 
embodiments of the presoit invention. 

In at least some embodiment of flie present invention, a feedback contirol combines tiie platens 
into a sin^e model using flie avraage of die tool states few eadi of die platens. The single model would 
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use the feedback approach described above to apportion the bias adjustment across the different platens 
in some predet^mined way. 

Also, in at least some embodiments of the present invention, a feedback control scheme uses 
the final thickness measurements to distribute feedback individually to all of the platens, A method for 
5 modeling a polishing process for which different platens are separately modeled and factored into the 
model is shown in FIG. 6. Because each platen can be can be treated individually, the tool state, e.g., 
the pad life and disk life, and idle time, can be included in the model and feedback can be specific to the 
platen and polishing recipe. This feedback control scheme is particularly useful when different 
polishing recipes are being carried out on each platen. The ability to separately model each platen 

10 provides a greater of degree processing flexibiUty, since it allows one to change the processing recipe at 
one platen (during one stage of the polishing process) while keeping the processing recipe at the 
remaining platens unchanged. 

In at least some embodiments of the present invention, the controller apphes feedback 
individually to each carrier head. Each carrier head performs in a unique manner and it is possible in 

15 updating the polishing recipe to separately review the past performance of each wafer carrier and to 
adjust the updated parameters accordingly. 

Feedback and feedforward control algorithms are constructed for use in the above control 
process based on the above models using various nn^ods. The algorithms may be used to optimize 
parameters u^g various methods, such as recursive parameter estimation. Recursive parameter 

20 estimation is used in situations such as these, where it is desirable to model on line at the same time as 
the input-output data is received. Recursive parameter estimation is weU suited for making decisions 
on line, such as adaptive control or adaptive predictions. For more details about the algorithms and 
theories of identification, see Ljung L., System Ident^cation - Theory for the Usen Prentice HalU 
Upper Saddle River, N J. 2nd edition, 1999. 

25 In at least some embodiments of the present invention, the polishing recipe may be updated in 

discrete increments or steps defined in the algorithms of the developed by the model. Also, in at least 
some embodiments of the present invention, the updated polishing recipes may be determined by 
interpolation to the appropriate parameters. 

Additional apparatus utilized to implement the feedforward and feedback loop include a film 

30 thickness measurement (metrology) tool to provide thickness data needed to calculate wafer material 
removal rate. The tool may be positioned on the polishing apparatus so as to provide in-line, in situ 
measurements, or it may be located remote firom the polishing apparatus. The tool may use optical, 
electrical, acoustic or mechanical measurement methods. A suitable thickness measurement device is 
available from Nanometrics (Milpitas, CA) or Nova Measuring Instruments (Phoenix, AZ). A 

35 computer may be utilized to calculate the optimal pad conditioning recipe based \xpon the measured 
film thickness and calculated removal rate, enoploying the models and algorithm provided according to 
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the invention. A suitable integrated controUer and polishing apparatns O^ina with lAPC or Mirra Mesa 
wth iAPQ is available from A|>plied Mataials, California. 

Exemplary sennconductor wafers that can be polished using the concepts discussed herein 
including, but are not limited to those made of alicon, tungsten, aluminum, copper, BPSG, USG, 
thermal oxide, siUcon-*elated fihns, and low k dielectrics and mixtures thereof. 

The invention may be practiced usmg any number of different types of conventional CMP 
poUshing pads. There are numerous polishing pads m the art that are generaUy made of urethane or 
other polymers. Exemplary polishing pads include Epic'^ poUshing pads (Cabot Microelectronics 
Corporation. Aurora IL) and Rodel® ICIOOO, IClOlO, IC1400 poUshing pads (Rodel Corporation, 
Newadc DE), OXP series poUshing pads (Sycamore Pad), Thomas West Pad 711, 813, 815, 815-Ultra, 
817, 826, 828, 828-El (Thomas West). 

Furthermore, any number of different types of slurry can be used in conjunction with aspects 
of the present invention. There ate numerous CMP poUshmg shraies in the ait, which are generally 
made to poUsh specific types of metals in semiconductor wafers. Exemplary slurries mclude Sani- 
Sperse® (available as Semi-Sperse® 12, Semi-Sperse® 25, Semi-Sperse® D7000, Semi-Sperse® 
D7100, Semi-Sperse® D73()0, Send-Sperse® PIOOO, Semi-Sperse® W2000, and Semi-Sperse® 
W2585) (Cabot Microelectronics Corporation, Aurora IL). Rodel ILD1300. Klebesol series, Elexsol . 
MSW1500, MSW2000 series. CUS series and PTS (Rodel). 

Various aspects of the present invention that can be controlled by a computer can be (and/or be 
controlled by) any number of controVcon^r entities, mcluding the one shown in HG. 7. Referring 
to HG. 7 a bus 756 SMves as die mam information hi^way mterconnecting the other components of 
system 711. CPU 758 is the central processing unit of the system, performuxg calculations and logic 
operations required to execute the processes of embodiments of the present invention as weU as oflier 
' programs. Read only memory (ROM) 760 and random access memory (RAM) 762 constitute the main 
memory of the system. Disk controller 764 interfaces one or more disk drives to die system bus 756. 
These disk drives are, for example. Hoppy disk drives 770. or CD ROM or DVD (digital video disks) 
drives 766, or internal or external hard drives 768, Ihese various disk drives and diskcontroUers are 
optional devices. 

A display interface 772 interfaces display 748 and permits mformation from the bos 756 to be 
displayed on display 748. Display 748 can be used in displaying a graphical user interface. 
Communications with external devices such as the other components of die system described above can 
occur ntiUang, for example, communication port 774. Optical fibers and/or electrical cables and/or 
conductors and/or optical communication (e.g., mfrared. and tiie Uke) and/or wireless comnninication 
(c.g.. radio frequency (RF). and fl» Uke) can be used as the tianspoit medium between tiie external 
devices and communication port 774. Peripheral intetfiice 754 interfaces the keyboard 750 and mouse 
752. permitting input data to be transmitted to bus 756. In addition to tiiese conqwnents, system 711 
also optionaUy includes an infrared tiansmitter and/or mfrared receiver. lafiared transmitters are 
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optionaUy utilized when the con^ter system is used in conjunction with one or mote of the processmg 
components/stations that transmits/receives data via infrared signal transmission. Instead of utilizing an 
infrared transmitter or infrared receiver, the computer system may also optionally use a low power 
radio transmitter 780 and/or a low power radio receiver 782. Hie low power radio transmitter transmits 
flie signal for reception by components of the production process, and receives signals from the 
components via the low power radio receiver. The low power radio transmitter and/or receiver are 

standard devices in industry. 

Although system 711 in FIG. 7 is illustrated having a single processor, a single hard disk drive 
and a single local memory, system 711 is optionally suitably equipped with any multitude or 
combination of processors or storage devices. For example, system 711 may be replaced by, or 
combined with, any suitable processing system operative in accordance wth ttie principles of 
embodiments of the present invention, inchiding sophisticated calculators, and hand-held, 
laptop/notebook, mini, mainframe and super computers, as weU as processing system networic 

combinations of tfie same. 

FIG. 8 is an iEustration of an exen^jlary conqputer readable memory medhim 884 utilizable for 
storing coirputer readable code or instroctions. As one exan5)le, medhim 884 may be used with disk 
drives illustrated in FIG. 7. Typically, memory media such as floppy disks, or a CSD ROM, or a digital 
video disk wffl contain, for example, a multi-byte locale for a smgle byte language and the program 
mfotmation for controlling the above system to enable the computer to perform the functions described 
heron. Alternatively, ROM 760 and/or RAM 762 iUusttated in KG. 7 can also be used to store the 
program mformation that is used to instruct the central processing unit 7S8 to perform the operations 
associated with the instant processes. Odier examples of suitable computer readable media for storing 
mformation include magnetic, electronic, or optical (including holographic) storage, some combination 
thereof, etc. In addition, at least some embodiments of die present invention contemplate tiiat the 
medium can be in the form of a ttansmission (e.g., digital or propagated signals). 

In general, it should be emphasized that various components of embodiments of die present 
invention can be implemented in hardware, software or a combination thereof. In such embodiments, 
the various components and steps would be implemented in hardware and/or software to perform the 
functions of the present invention. Any presently available or ftitpte developed computer software 
language and/or hardware components can be employed m such embodiments of the present invention. 
For example, at least some of the fimctionaUty mentioned above could be implemented usmg the C, 
C++, or any assembly language apiMwpriate in view of die processor(s) being used. It could also be 
written in an intwpretive environment such as Java and transported to multiple destinations to various 
users. 

Althou^ various anbodiments which incorporate the teachings of the presoit invemtion have 
been shown and described m detail herein, those skilled in the art can readUy devise many other varied 
embodiments that incorporate these teachings. 
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What is claimed is: 

1. A method of producing a unifonn wafer thickness profile in a polishing operation, 
comprising: 

a) providing a model for a wafer polishing that defines a pluraHty of regions on a wafer and 
identifies a wafer mat^al removal rate in a polishmg step for each of the regions; and 

(b) polishing a wafer using a poUshing recipe that generates a target thickness profile for each 

region. 

2. A method of controlling surface non-uniformity of a wafer in a polishing operation, 
comprising: 

a) providing a model for a wafer poUshing that defines a plurality of regions on a wafer and 
identifies a wafer material removal rate in a poUshing step of a poUshing process for each of the 
regions, wherein the poUshing process comprises a pluraUty of poUshing steps; 

b) poUshmg a wafer using a first poUshing recipe based upon an incoming wafer thickness 

profile; 

c) determining a wafer fluckness profile for the post-polished wafw of step (^^^ and 

d) calculating an updated poUshiiig recipe based upon the wafer thickness 
and the model of step (a) to maintain a target wafer thickness profile. 

3. The method of claim 2, wherein the first poUshing recipe is based on the model of step 
(a) to obtain the target wafer thickness profile. 

4. The method of claim 2, wherein the first poUshing recipe is determined erapiricaUy . 

5. The method of claim 1 or 2, wherein the pluraUty of regions in the model of step (a) 
comprises regions extending radiaUy outward from a center point on the wafer. 

6. The method of claim 5, wheiein the model comprises four or more regions. 

7. The method of claim 1, wheiein the poUshing of step (b) comprises poUshing the wafer 
at a pluraUty of polishing stations. 

8. The method of claim 7, wherem die polishing step is carried out at fluee poUshing 
stations. 



9. The method of claim 7, wherein the poUshing recipe at at least two poUshing stations is 
the same. 
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10. Tiie method of claim 7, wherein the polishing recipe at at least two polish 
dififerent. 

5 11. The method of claun 7, wherein the calculating the updated poUshing recipe of step (c) 

comprises calculating updated poUshing recipes for each of the pluraUty of poBshmg stations. 

12. The method of claim 11. wherein the updated polishing recipes for each of die phnality 
of polishing stations accounts for the tool state of the individual polishing stations. 

10 

13. The method of claim 9 or 10, wherem die polishing of stqp (b) is carried out at a 
plurahty of poUshing stations, and wherein the wafer thickness profile for each of the subsequent 
poUshing stations provided by the prediction from previous staticms. 

15 14. The method of claim 1 or 2, wheidn tibe step of providmg a model comprises: 

(e) measuring pre-polished wafer thickness m each of a phuaUty of regions defined on one or 
more wafers; 

(f) poUshmg the one or more wafers, wherem polishmg comprises poKshmg the one or more 

wafers m a plurality of polishing st^s; 
20 (g) measuring the wafer material removal rate for the one or more wafers at each of the 

plurality of regions aftor each of tte poUshing st^s of step (g); 

(h) providfaig a model definmg the effect of tool state on poUshing effectiveness; and 

(i) recording the pre-polished and post-poUshed wafer thicknesses for each or die regions on a 
recordable mecUum. 

25 

15. The method of claim 14, furtha: comprising: 

fitting the data to a Unear or non-linear curve tiiat estabUshes a relation^p betwetm the 
material removal rate of a region of the wafer and a poUshing parameter of mterest. 

30 16. The mediodofclaim 14. wherein the poUshing parameter comprises polishmg time. 

17. The method of claun 16, wherein the poBshmg parameters furihta: coiqprise a 
patameterselectedfromthegroupconsistingofpoUshmgtime,poUshingpaddownforces and 
velocity, slurry flow and composition, conditionmg time, conditioning disk down forces and velocity, 
35 oscillating speeds of bofli die conditioning disk and the wafer carrier. 
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18. The method of claim 1 or 2. wheidnthe wafer removal for a region/ m the model of 
step (a) is determined according to the equaticm: 

AJIV= (cujBX, H- c,^)st,.(c2»sx2 + c^^t,* (csjj0^ + + (c^v^^r^ <^^-^ + 

where xi. xj, jc* and X5 are die additional parameter values for polishing steps 1, 2, 3, 4, and 
5. respectively; *u t,, and t, are the poUshing times for poUshing steps 1. 2. 3. 4. and 5. 
respectively, and c^ provides the contribution to wafer removal of the variables in polishing step a in 
region and provides the contribution to wafer removal of poUshing time in polishing step a. 

19. The meUiod of claim 18 wherein the wafer material removal rate profile accounts for 
tool state by scaling die profile using die scaling factor. 

where die terms ^ and U refer to pad and disk life, respectively, widi units of hour; and die 
terms V *^ and V are empirically determined coefficients relating pad and disk life to removal rate. 

20. The mediod of claim 2, wherein updated polishing recipe is attained by solving the 
equation: 

X 

where X is a vector of times and odier processing parameters conesponding to die polishing recipe; z(x) 
is die model for diepolishmg process. ^ is a vector of die desired average region wafer ducknesses; 
and/ y". m) is a penalty function to penalize die deviation between tiie model predictions g(x) and 
die desired thidoiesses -f. 

21. A mediod of determining a model for wafer diickness profile, comprising: 

(a) measuring pr^-polished wafer diickness in each of a plurality of regions defined on one or 
more wafers; 

(b) poUshing die one or more wafers, wheran poUshing comprises poUshing die one or more 
wafers in a pluraUty of poUshing steps; 

(c) measuring die wafer material removal rate for die one or more wafers at each of die 
plurality of regions after each of the poUshii^ steps of st^ (b); 

(d) providing a model defining die effect of tool state on polishing effectiveness; and 

(e) recording die pre-poUshed and post-poUshed wafer diicknesses for each or die regions on a 
recordable medium. 

22. The mediod of claim 21, fiirdiM: conqjrising: 

fitting die data to a Unear or non-Unear curve diat estabUshes a relationship between die 
material removal rate of a region of die wafer and a poUshing parameter of interest 
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23. The method of claim 22, wherein the polishing parameter comprises poUshing time. 

24. The method of claim 23, wherein the polishing parameters further comprise a 
parameter selected ftom the group consisting of polishing time, polishing pad down forces and 
velocity, slrniy flow and composition, conditioning time, conditioning disk down forces and velocity, 
oscillating speeds of both the conditionuig disk and the wafer carrier. 

25. The method of claim 21 wherein the wafer material removal for a region j (AR'j) in the 
) model of step (a) is determined according to the equation: 

where xi. X2, xs, X4, and Xs are the additional parameter values for polishmg st^ 1. 2. 3, 4, and 
5, respectively; h. (2. ts, U and t, are the polishing times for polishmg steps 1, 2, 3. 4, and 5. 
5 respectively, and c^ provides the contribution to wafer removal of the variable x in polishing st^ a in 
region J; and c^i provides the contribution to wafer removal of polishmg time in polishing step a. 

26. The method of claim 21 wherein the wafer material removal rate profile accounts for 
tool state by scaling the profile using the seating factor 

(l + fe^-r^+ferf-fw+fcprf-'p-'J' 
where the terms t, and r^ refer to pad and disk life, respectively, with units of hour, and the 
terms fc<, and i^a are empiricaUy determined coefficients relating pad and disk life to len^^^ 



20 



25 



27. The mettiod of claim 22, wherdn a the model is determined using less than 10 wafers. 



28. An apparatus for conditioning polishing pads used to planarize substrates, compnsmg: 
acarrier assembly having a plurality of arms for holding a wafer positionable over a ptarality of 
planaiizing surfaces of a phuality of p<disWng pads; 

controlling means enable of controlling an operating parameter of the poUshing process; and 
30 a controller opeiatively «?oupled to the controlling means, the controller operating the 

controlling means to adjust the operating parameter of the polishing process as a function of a model for 
a wafer thickness profile, the model comprising: 

defining a polishing model that defines a plurality of regions on a wafer and identifies a wafer 
material removal rate in a polishing step of a polishing process for each of the regions, wherein the 
35 polishing process con^rises a plurality of polishing steps. 
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29. "ITie apparatus of claim 28, wh«eiii Ae model defines wafer removal for a re^on j 
(AR'j) ia flie wafer material removal rate model is determined according to the equati<»i: 

AR;= (CujSXj+Cj^tl^(C2l,SX2 + C2^Sh+ (C3VSX3 + Csy)St3 + (C4,jSXj+ C^)St^ {CSI} SXs+ Cs2J>Sts. 

5 where xu xi. xj, jt* and are the additional parameter values for polishing steps 1. 2, 3. 4. and 

5, xespectivdy; ty tj. ts. t4. and are the polishing times for polishing steps 1, 2, 3, 4, and 5, 
respectively, and Caij provides the contribution to wafer removal of the variable x in polishing step a in 
region/; and provides the contribution to wafer removal oif polishing time in polishing step a. 

10 30. A computer readable medium comprising instractions bang executed by a con^rater, 

the instnictions including a computer-implemented software appUcation for a chemical mechanical 
polishing process, the instractions for implMnenting the loocess conq»rising: 

a) receiving data from a chemical mechanical polishing tool relating to the wafer removal rate 
of at least one waf«: processed in the chenncal mechanical polishing process; and i 

15 b) calculating,ftomthedataofstep(a),updatedpolishingrecipe,whereintbenpdated 

polishing recipe is calculated by determining the difference between an ou^ of a wafer material 
removal rate mod^ and die data of stq> (a). 

31. Hie medium of claim 28, wherein the model for a wafer material removal rate defines a 
20 plurality of regions on a wafer and identifies a wafer material removal rate in a polishing step of a 

polishing process for each of the regions, wherein the polishing process comprises apluralityof 

polisUng steps. 

32. The medium of claim 30, wherein the wafer removal for a region j iAR'j) in Ae wafer 
25 material removal rate model is determined according to the equation: 

AR'j = {Cui SXi + Ciy)Sti^(C2i,SX2 + C22j)St2+ (C3»=Xs + C3y)St3 + (c„j SX,+ C^)Str^ (CsijSXs + C5y}Sts. 

where xj, jc^ jc« and are the additional parameter values for polishing steps 1. 2. 3. 4. and 
5. respectively; ty ta f* r* and ts are tiie polishing times for polishing steps 1, 2, 3. 4, and 5, 
30 respectively, and c,« provides the contribution to wafer removal of die variable x in polishing sto^ a in 
r^on f, and provides tiie conteibution to wafer removal of poUshing time in polishmg step a. 
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